Overview

Dataset statistics

Number of variables21
Number of observations21312
Missing cells31352
Missing cells (%)7.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 MiB
Average record size in memory220.7 B

Variable types

NUM14
CAT3
BOOL3
DATE1

Reproduction

Analysis started2020-05-05 08:24:12.110113
Analysis finished2020-05-05 08:25:34.054421
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
producto has a high cardinality: 72 distinct values High cardinality
month is highly correlated with quarter and 1 other fieldsHigh Correlation
quarter is highly correlated with month and 1 other fieldsHigh Correlation
weekofyear is highly correlated with quarter and 1 other fieldsHigh Correlation
udsstock has 8109 (38.0%) missing values Missing
udsventa has 5350 (25.1%) missing values Missing
udsprevisionempresa has 4948 (23.2%) missing values Missing
roll4wd_udsventa has 4164 (19.5%) missing values Missing
meanwd_udsventa has 1806 (8.5%) missing values Missing
roll4wd_udsstock has 2154 (10.1%) missing values Missing
roll4wd_udsprevisionempresa has 3939 (18.5%) missing values Missing
meanwd_udsprevisionempresa has 882 (4.1%) missing values Missing
weekday has 3024 (14.2%) zeros Zeros
sin_weekday has 3024 (14.2%) zeros Zeros
meanwd_udsprevisionempresa has 252 (1.2%) zeros Zeros

Variables

fecha
Date

Distinct count296
Unique (%)1.4%
Missing0
Missing (%)0.0%
Memory size333.0 KiB
Minimum2019-06-05 00:00:00
Maximum2020-03-26 00:00:00
Histogram

producto
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count72
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size333.0 KiB
64
 
296
61
 
296
50
 
296
48
 
296
12
 
296
Other values (67)
19832
ValueCountFrequency (%) 
64 296 1.4%
 
61 296 1.4%
 
50 296 1.4%
 
48 296 1.4%
 
12 296 1.4%
 
31 296 1.4%
 
40 296 1.4%
 
28 296 1.4%
 
89 296 1.4%
 
3 296 1.4%
 
Other values (62) 18352 86.1%
 

Length

Max length2
Mean length1.902777778
Min length1
ValueCountFrequency (%) 
Decimal_Number 10 100.0%
 
ValueCountFrequency (%) 
Common 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

udsstock
Real number (ℝ≥0)

MISSING
Distinct count2525
Unique (%)19.1%
Missing8109
Missing (%)38.0%
Infinite0
Infinite (%)0.0%
Mean1464.94167992123
Minimum1.0
Maximum64729.0
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum1
5-th percentile30
Q1416
median940
Q31757
95-th percentile4231
Maximum64729
Range64728
Interquartile range (IQR)1341

Descriptive statistics

Standard deviation2505.118828
Coefficient of variation (CV)1.710046798
Kurtosis176.5901822
Mean1464.94168
Median Absolute Deviation (MAD)1150.580857
Skewness10.55962398
Sum19341625
Variance6275620.344
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
9 62 0.3%
 
19 60 0.3%
 
39 56 0.3%
 
3 54 0.3%
 
969 53 0.2%
 
6 52 0.2%
 
930 51 0.2%
 
26 50 0.2%
 
1 45 0.2%
 
736 44 0.2%
 
Other values (2515) 12676 59.5%
 
(Missing) 8109 38.0%
 
ValueCountFrequency (%) 
1 45 0.2%
 
3 54 0.3%
 
5 19 0.1%
 
6 52 0.2%
 
8 20 0.1%
 
ValueCountFrequency (%) 
64729 1 < 0.1%
 
57022 2 < 0.1%
 
56215 1 < 0.1%
 
51299 1 < 0.1%
 
50827 1 < 0.1%
 

udsventa
Real number (ℝ≥0)

MISSING
Distinct count2484
Unique (%)15.6%
Missing5350
Missing (%)25.1%
Infinite0
Infinite (%)0.0%
Mean1127.530447312367
Minimum0.0
Maximum41564.0
Zeros25
Zeros (%)0.1%
Memory size333.0 KiB

Quantile statistics

Minimum0
5-th percentile59
Q1339
median669
Q31364
95-th percentile3542
Maximum41564
Range41564
Interquartile range (IQR)1025

Descriptive statistics

Standard deviation1608.355983
Coefficient of variation (CV)1.426441288
Kurtosis144.0787428
Mean1127.530447
Median Absolute Deviation (MAD)889.8335557
Skewness8.525744921
Sum17997641
Variance2586808.968
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
472 162 0.8%
 
944 109 0.5%
 
2361 99 0.5%
 
2755 82 0.4%
 
1416 79 0.4%
 
787 74 0.3%
 
3306 73 0.3%
 
590 72 0.3%
 
1180 71 0.3%
 
393 71 0.3%
 
Other values (2474) 15070 70.7%
 
(Missing) 5350 25.1%
 
ValueCountFrequency (%) 
0 25 0.1%
 
1 24 0.1%
 
2 35 0.2%
 
3 26 0.1%
 
4 31 0.1%
 
ValueCountFrequency (%) 
41564 1 < 0.1%
 
39556 1 < 0.1%
 
39202 1 < 0.1%
 
37313 1 < 0.1%
 
36211 1 < 0.1%
 

udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count7052
Unique (%)43.1%
Missing4948
Missing (%)23.2%
Infinite0
Infinite (%)0.0%
Mean4781.039721339525
Minimum0.0
Maximum102304.0
Zeros93
Zeros (%)0.4%
Memory size333.0 KiB

Quantile statistics

Minimum0
5-th percentile98
Q11106.75
median2639
Q35903
95-th percentile16531
Maximum102304
Range102304
Interquartile range (IQR)4796.25

Descriptive statistics

Standard deviation6509.194795
Coefficient of variation (CV)1.361460095
Kurtosis26.59691195
Mean4781.039721
Median Absolute Deviation (MAD)4087.909794
Skewness3.941289985
Sum78236934
Variance42369616.88
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 93 0.4%
 
2833 33 0.2%
 
1 31 0.1%
 
2 30 0.1%
 
4 29 0.1%
 
1700 25 0.1%
 
3306 24 0.1%
 
9918 23 0.1%
 
6612 23 0.1%
 
4250 22 0.1%
 
Other values (7042) 16031 75.2%
 
(Missing) 4948 23.2%
 
ValueCountFrequency (%) 
0 93 0.4%
 
1 31 0.1%
 
2 30 0.1%
 
3 14 0.1%
 
4 29 0.1%
 
ValueCountFrequency (%) 
102304 1 < 0.1%
 
98998 1 < 0.1%
 
98716 1 < 0.1%
 
81475 1 < 0.1%
 
76489 1 < 0.1%
 

promo
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size333.0 KiB
0
16480
1
4832
ValueCountFrequency (%) 
0 16480 77.3%
 
1 4832 22.7%
 

festivo
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size333.0 KiB
0
20736
1
 
576
ValueCountFrequency (%) 
0 20736 97.3%
 
1 576 2.7%
 

weekday
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9966216216216215
Minimum0
Maximum6
Zeros3024
Zeros (%)14.2%
Memory size333.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.994122996
Coefficient of variation (CV)0.665457054
Kurtosis-1.240859336
Mean2.996621622
Median Absolute Deviation (MAD)1.706560446
Skewness0.004656882354
Sum63864
Variance3.976526524
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 3096 14.5%
 
2 3096 14.5%
 
6 3024 14.2%
 
5 3024 14.2%
 
4 3024 14.2%
 
1 3024 14.2%
 
0 3024 14.2%
 
ValueCountFrequency (%) 
0 3024 14.2%
 
1 3024 14.2%
 
2 3096 14.5%
 
3 3096 14.5%
 
4 3024 14.2%
 
ValueCountFrequency (%) 
6 3024 14.2%
 
5 3024 14.2%
 
4 3024 14.2%
 
3 3096 14.5%
 
2 3096 14.5%
 

quarter
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size333.0 KiB
4
6624
3
6624
1
6192
2
1872
ValueCountFrequency (%) 
4 6624 31.1%
 
3 6624 31.1%
 
1 6192 29.1%
 
2 1872 8.8%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

month
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count10
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.993243243243243
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q13
median8
Q310
95-th percentile12
Maximum12
Range11
Interquartile range (IQR)7

Descriptive statistics

Standard deviation3.661418958
Coefficient of variation (CV)0.523565223
Kurtosis-1.215477346
Mean6.993243243
Median Absolute Deviation (MAD)3.109751644
Skewness-0.3460820536
Sum149040
Variance13.40598879
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 6.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12 2232 10.5%
 
10 2232 10.5%
 
8 2232 10.5%
 
7 2232 10.5%
 
1 2232 10.5%
 
11 2160 10.1%
 
9 2160 10.1%
 
2 2088 9.8%
 
6 1872 8.8%
 
3 1872 8.8%
 
ValueCountFrequency (%) 
1 2232 10.5%
 
2 2088 9.8%
 
3 1872 8.8%
 
6 1872 8.8%
 
7 2232 10.5%
 
ValueCountFrequency (%) 
12 2232 10.5%
 
11 2160 10.1%
 
10 2232 10.5%
 
9 2160 10.1%
 
8 2232 10.5%
 

weekofyear
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count43
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.469594594594593
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum1
5-th percentile3
Q111
median31
Q342
95-th percentile50
Maximum52
Range51
Interquartile range (IQR)31

Descriptive statistics

Standard deviation15.95001268
Coefficient of variation (CV)0.560247271
Kurtosis-1.228771251
Mean28.46959459
Median Absolute Deviation (MAD)13.65613587
Skewness-0.3250216912
Sum606744
Variance254.4029045
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 12.5 23.5 51.5 52. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47 504 2.4%
 
51 504 2.4%
 
7 504 2.4%
 
38 504 2.4%
 
6 504 2.4%
 
37 504 2.4%
 
5 504 2.4%
 
52 504 2.4%
 
36 504 2.4%
 
4 504 2.4%
 
Other values (33) 16272 76.4%
 
ValueCountFrequency (%) 
1 504 2.4%
 
2 504 2.4%
 
3 504 2.4%
 
4 504 2.4%
 
5 504 2.4%
 
ValueCountFrequency (%) 
52 504 2.4%
 
51 504 2.4%
 
50 504 2.4%
 
49 504 2.4%
 
48 504 2.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size187.3 KiB
True
17712
False
3600
ValueCountFrequency (%) 
True 17712 83.1%
 
False 3600 16.9%
 

sin_weekday
Real number (ℝ)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.004759498821957349
Minimum-0.9749279121818236
Maximum0.9749279121818236
Zeros3024
Zeros (%)14.2%
Memory size333.0 KiB

Quantile statistics

Minimum-0.9749279122
5-th percentile-0.9749279122
Q1-0.7818314825
median0
Q30.7818314825
95-th percentile0.9749279122
Maximum0.9749279122
Range1.949855824
Interquartile range (IQR)1.563662965

Descriptive statistics

Standard deviation0.7074387216
Coefficient of variation (CV)148.6372301
Kurtosis-1.500182941
Mean0.004759498822
Median Absolute Deviation (MAD)0.6270716718
Skewness-0.01056263076
Sum101.4344389
Variance0.5004695448
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.97492791 -0.60785761 -0.21694187 0.21694187 0.60785761 0.8783797 0.97492791], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.4338837391 3096 14.5%
 
0.9749279122 3096 14.5%
 
-0.9749279122 3024 14.2%
 
-0.7818314825 3024 14.2%
 
-0.4338837391 3024 14.2%
 
0.7818314825 3024 14.2%
 
0 3024 14.2%
 
ValueCountFrequency (%) 
-0.9749279122 3024 14.2%
 
-0.7818314825 3024 14.2%
 
-0.4338837391 3024 14.2%
 
0 3024 14.2%
 
0.4338837391 3096 14.5%
 
ValueCountFrequency (%) 
0.9749279122 3096 14.5%
 
0.7818314825 3024 14.2%
 
0.4338837391 3096 14.5%
 
0 3024 14.2%
 
-0.4338837391 3024 14.2%
 

cos_weekday
Real number (ℝ)

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.003795573654928178
Minimum-0.9009688679024191
Maximum1.0
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum-0.9009688679
5-th percentile-0.9009688679
Q1-0.9009688679
median-0.222520934
Q30.6234898019
95-th percentile1
Maximum1
Range1.900968868
Interquartile range (IQR)1.52445867

Descriptive statistics

Standard deviation0.7067816624
Coefficient of variation (CV)-186.2120793
Kurtosis-1.498346475
Mean-0.003795573655
Median Absolute Deviation (MAD)0.6408877408
Skewness0.0090077723
Sum-80.89126573
Variance0.4995403184
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-0.90096887 -0.90096887 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-0.222520934 3096 14.5%
 
-0.9009688679 3096 14.5%
 
-0.9009688679 3024 14.2%
 
-0.222520934 3024 14.2%
 
0.6234898019 3024 14.2%
 
0.6234898019 3024 14.2%
 
1 3024 14.2%
 
ValueCountFrequency (%) 
-0.9009688679 3024 14.2%
 
-0.9009688679 3096 14.5%
 
-0.222520934 3024 14.2%
 
-0.222520934 3096 14.5%
 
0.6234898019 3024 14.2%
 
ValueCountFrequency (%) 
1 3024 14.2%
 
0.6234898019 3024 14.2%
 
0.6234898019 3024 14.2%
 
-0.222520934 3096 14.5%
 
-0.222520934 3024 14.2%
 

stockMissingType
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size333.0 KiB
0
13203
2
7101
1
 
1008
ValueCountFrequency (%) 
0 13203 62.0%
 
2 7101 33.3%
 
1 1008 4.7%
 

Length

Max length3
Mean length3
Min length3
ValueCountFrequency (%) 
Decimal_Number 3 75.0%
 
Other_Punctuation 1 25.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

roll4wd_udsventa
Real number (ℝ≥0)

MISSING
Distinct count11281
Unique (%)65.8%
Missing4164
Missing (%)19.5%
Infinite0
Infinite (%)0.0%
Mean1091.6569891532538
Minimum0.0
Maximum28339.0
Zeros26
Zeros (%)0.1%
Memory size333.0 KiB

Quantile statistics

Minimum0
5-th percentile54
Q1360
median676.3125
Q31318.71875
95-th percentile3367.825
Maximum28339
Range28339
Interquartile range (IQR)958.71875

Descriptive statistics

Standard deviation1349.899337
Coefficient of variation (CV)1.236559973
Kurtosis43.96518431
Mean1091.656989
Median Absolute Deviation (MAD)826.6935037
Skewness4.78625492
Sum18719734.05
Variance1822228.22
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
472 75 0.4%
 
7 55 0.3%
 
2 36 0.2%
 
944 35 0.2%
 
6 30 0.1%
 
4 26 0.1%
 
1 26 0.1%
 
0 26 0.1%
 
885 18 0.1%
 
708 18 0.1%
 
Other values (11271) 16803 78.8%
 
(Missing) 4164 19.5%
 
ValueCountFrequency (%) 
0 26 0.1%
 
1 26 0.1%
 
1.142857143 1 < 0.1%
 
1.428571429 1 < 0.1%
 
1.6 1 < 0.1%
 
ValueCountFrequency (%) 
28339 1 < 0.1%
 
22316.75 1 < 0.1%
 
20309 1 < 0.1%
 
19187.625 1 < 0.1%
 
19128.5 1 < 0.1%
 

meanwd_udsventa
Real number (ℝ≥0)

MISSING
Distinct count458
Unique (%)2.3%
Missing1806
Missing (%)8.5%
Infinite0
Infinite (%)0.0%
Mean1048.0396869596257
Minimum1.0
Maximum9651.846153846154
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum1
5-th percentile29.04761905
Q1347.6756757
median698.8461538
Q31241.131579
95-th percentile3304.3
Maximum9651.846154
Range9650.846154
Interquartile range (IQR)893.4559033

Descriptive statistics

Standard deviation1170.45711
Coefficient of variation (CV)1.116806095
Kurtosis12.02088122
Mean1048.039687
Median Absolute Deviation (MAD)771.8843392
Skewness2.894202013
Sum20443062.13
Variance1369969.847
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
8 84 0.4%
 
1 84 0.4%
 
787 84 0.4%
 
379.425 43 0.2%
 
859.275 43 0.2%
 
362.8529412 43 0.2%
 
381.075 43 0.2%
 
1009 43 0.2%
 
3090.95 43 0.2%
 
285.675 43 0.2%
 
Other values (448) 18953 88.9%
 
(Missing) 1806 8.5%
 
ValueCountFrequency (%) 
1 84 0.4%
 
1.5 42 0.2%
 
2 42 0.2%
 
4 42 0.2%
 
4.333333333 42 0.2%
 
ValueCountFrequency (%) 
9651.846154 43 0.2%
 
7668.605263 42 0.2%
 
7015.538462 42 0.2%
 
6539.25641 42 0.2%
 
6464.5 43 0.2%
 

roll4wd_udsstock
Real number (ℝ≥0)

MISSING
Distinct count12022
Unique (%)62.8%
Missing2154
Missing (%)10.1%
Infinite0
Infinite (%)0.0%
Mean1442.005518955155
Minimum1.0
Maximum57022.0
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum1
5-th percentile44.12142857
Q1469.5
median952.375
Q31710.28125
95-th percentile3976.7375
Maximum57022
Range57021
Interquartile range (IQR)1240.78125

Descriptive statistics

Standard deviation2195.892139
Coefficient of variation (CV)1.522804255
Kurtosis122.3428235
Mean1442.005519
Median Absolute Deviation (MAD)1081.779656
Skewness8.396315353
Sum27625941.73
Variance4821942.287
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6 50 0.2%
 
9 46 0.2%
 
407 37 0.2%
 
1 32 0.2%
 
19 27 0.1%
 
930 27 0.1%
 
710 26 0.1%
 
116 25 0.1%
 
26 25 0.1%
 
3 24 0.1%
 
Other values (12012) 18839 88.4%
 
(Missing) 2154 10.1%
 
ValueCountFrequency (%) 
1 32 0.2%
 
2.25 1 < 0.1%
 
3 24 0.1%
 
3.75 2 < 0.1%
 
4.5 3 < 0.1%
 
ValueCountFrequency (%) 
57022 1 < 0.1%
 
51299 2 < 0.1%
 
43633.75 1 < 0.1%
 
42400.25 1 < 0.1%
 
40756 4 < 0.1%
 

meanwd_udsstock
Real number (ℝ≥0)

Distinct count502
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1436.7039646740225
Minimum27.714285714285715
Maximum15575.92857142857
Zeros0
Zeros (%)0.0%
Memory size333.0 KiB

Quantile statistics

Minimum27.71428571
5-th percentile92.22580645
Q1545.5833333
median1019.272727
Q31821.266667
95-th percentile4069.222222
Maximum15575.92857
Range15548.21429
Interquartile range (IQR)1275.683333

Descriptive statistics

Standard deviation1574.786742
Coefficient of variation (CV)1.096110807
Kurtosis19.8513319
Mean1436.703965
Median Absolute Deviation (MAD)982.6044583
Skewness3.59022737
Sum30619034.9
Variance2479953.281
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 27.71428571 53.96394231 58.51388889 69.98333333 70.42333333 ... 6529.45512821 9065.97916667 9535.51666667 10341.52604167 15575.92857143], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1152.333333 85 0.4%
 
756 84 0.4%
 
1961.09375 43 0.2%
 
968.5806452 43 0.2%
 
3783.285714 43 0.2%
 
5748.56 43 0.2%
 
3859.789474 43 0.2%
 
1019.272727 43 0.2%
 
1321.291667 43 0.2%
 
715.1290323 43 0.2%
 
Other values (492) 20799 97.6%
 
ValueCountFrequency (%) 
27.71428571 42 0.2%
 
33.92307692 43 0.2%
 
42.71428571 43 0.2%
 
45.92857143 43 0.2%
 
52.3125 42 0.2%
 
ValueCountFrequency (%) 
15575.92857 42 0.2%
 
11012.21875 43 0.2%
 
9670.833333 42 0.2%
 
9400.2 42 0.2%
 
9193.291667 42 0.2%
 

roll4wd_udsprevisionempresa
Real number (ℝ≥0)

MISSING
Distinct count15468
Unique (%)89.0%
Missing3939
Missing (%)18.5%
Infinite0
Infinite (%)0.0%
Mean4898.456518324823
Minimum0.0
Maximum102304.0
Zeros122
Zeros (%)0.6%
Memory size333.0 KiB

Quantile statistics

Minimum0
5-th percentile67.825
Q11198
median2770.75
Q36009.5
95-th percentile16698.025
Maximum102304
Range102304
Interquartile range (IQR)4811.5

Descriptive statistics

Standard deviation6559.633452
Coefficient of variation (CV)1.339122523
Kurtosis25.04161269
Mean4898.456518
Median Absolute Deviation (MAD)4128.878475
Skewness3.856323727
Sum85100885.09
Variance43028791.03
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 122 0.6%
 
1 22 0.1%
 
2 16 0.1%
 
5 13 0.1%
 
4 10 < 0.1%
 
6 9 < 0.1%
 
29 8 < 0.1%
 
2.5 7 < 0.1%
 
147 7 < 0.1%
 
7 7 < 0.1%
 
Other values (15458) 17152 80.5%
 
(Missing) 3939 18.5%
 
ValueCountFrequency (%) 
0 122 0.6%
 
0.1428571429 1 < 0.1%
 
0.25 3 < 0.1%
 
0.4 1 < 0.1%
 
0.4285714286 1 < 0.1%
 
ValueCountFrequency (%) 
102304 1 < 0.1%
 
98716 1 < 0.1%
 
85217.5 1 < 0.1%
 
82986 1 < 0.1%
 
81475 1 < 0.1%
 

meanwd_udsprevisionempresa
Real number (ℝ≥0)

MISSING
ZEROS
Distinct count477
Unique (%)2.3%
Missing882
Missing (%)4.1%
Infinite0
Infinite (%)0.0%
Mean4222.22435998539
Minimum0.0
Maximum37273.0
Zeros252
Zeros (%)1.2%
Memory size333.0 KiB

Quantile statistics

Minimum0
5-th percentile23.9
Q11175.210526
median2710.674426
Q35724.236842
95-th percentile13489.91892
Maximum37273
Range37273
Interquartile range (IQR)4549.026316

Descriptive statistics

Standard deviation4816.679958
Coefficient of variation (CV)1.140792044
Kurtosis8.587230732
Mean4222.22436
Median Absolute Deviation (MAD)3391.085812
Skewness2.407909484
Sum86260043.67
Variance23200405.82
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 252 1.2%
 
147 84 0.4%
 
5862.794872 43 0.2%
 
1720.076923 43 0.2%
 
1724.578947 43 0.2%
 
4871.052632 43 0.2%
 
1569.974359 43 0.2%
 
2332.973684 43 0.2%
 
2651.421053 43 0.2%
 
12004.2973 43 0.2%
 
Other values (467) 19750 92.7%
 
(Missing) 882 4.1%
 
ValueCountFrequency (%) 
0 252 1.2%
 
2.5 42 0.2%
 
5.666666667 42 0.2%
 
5.888888889 42 0.2%
 
6.2 42 0.2%
 
ValueCountFrequency (%) 
37273 43 0.2%
 
33120.13158 42 0.2%
 
23130.45946 43 0.2%
 
23025.10256 43 0.2%
 
22957.05128 42 0.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

fechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdaystockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
02019-06-05118275.02833.0102304.00.00.022623True0.974928-0.2225210.02833.03405.13157918275.011012.218750102304.037273.000000
12019-06-05102267.01874.036888.01.00.022623True0.974928-0.2225210.01874.02440.7631582267.01822.16129036888.010102.540541
22019-06-05112506.02755.031078.00.00.022623True0.974928-0.2225210.02755.02323.1351352506.03987.66666731078.010973.815789
32019-06-05121279.01161.033661.00.00.022623True0.974928-0.2225210.01161.01143.3684211279.01605.74193533661.05724.236842
42019-06-05132493.01603.032119.01.00.022623True0.974928-0.2225210.01603.01874.4473682493.03113.84375032119.08500.421053
52019-06-05141332.01626.030253.00.00.022623True0.974928-0.2225210.01626.01801.4358971332.01157.80000030253.09599.657895
62019-06-05151447.0472.016814.00.00.022623True0.974928-0.2225210.0472.0781.7500001447.01580.92857116814.03691.052632
72019-06-05161154.01446.027856.01.00.022623True0.974928-0.2225210.01446.01403.8387101154.01348.04166727856.07479.529412
82019-06-05173755.02270.038503.00.00.022623True0.974928-0.2225210.02270.02352.8717953755.02990.25000038503.010634.894737
92019-06-05182962.01794.039619.00.00.022623True0.974928-0.2225210.01794.01875.2820512962.01152.33333339619.09314.973684

Last rows

fechaproductoudsstockudsventaudsprevisionempresapromofestivoweekdayquartermonthweekofyearworking_daysin_weekdaycos_weekdaystockMissingTyperoll4wd_udsventameanwd_udsventaroll4wd_udsstockmeanwd_udsstockroll4wd_udsprevisionempresameanwd_udsprevisionempresa
213022020-03-2684NaNNaN358.00.00.031313True0.433884-0.9009692.0508.142857285.000000114.00225.2400001623.8751240.307692
213032020-03-2687868.0NaN273.01.00.031313True0.433884-0.9009690.0682.857143329.3500001066.00356.6206901376.2501684.102564
213042020-03-268857.0NaN231.00.00.031313True0.433884-0.9009690.0441.857143259.52500072.00103.1333331161.0001408.897436
213052020-03-268921.0NaN213.00.00.031313True0.433884-0.9009690.0529.714286305.00000043.5045.9285711071.0001453.512821
213062020-03-269930.0NaN1574.00.00.031313True0.433884-0.9009690.05622.8571434046.6153854554.002725.0000007123.8759005.923077
213072020-03-269138.0NaN272.00.00.031313True0.433884-0.9009690.01284.714286415.35000031.2597.1851851219.1251593.512821
213082020-03-2694947.0NaN229.00.00.031313True0.433884-0.9009690.0548.142857249.850000574.25543.5000001145.7501179.225000
213092020-03-2696465.0NaN266.01.00.031313True0.433884-0.9009690.0460.571429285.675000737.25491.9310341327.1251399.375000
213102020-03-2697395.0NaN235.00.00.031313True0.433884-0.9009690.0151.857143346.375000434.00310.2580651177.7501569.974359
213112020-03-269895.0NaN243.00.00.031313True0.433884-0.9009690.0371.857143266.900000559.25428.6666671237.8751306.846154